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ABSTRACT 

Modern radio pulsar surveys produce a large volume of prospective candidates, the 
majority of which are polluted by human-created radio frequency interference or other 
forms of noise. Typically, large numbers of candidates need to be visually inspected in 
order to determine if they are real pulsars. This process can be labor intensive. In this 
paper, we introduce an algorithm called PEACE (Pulsar Evaluation Algorithm for 
Candidate Extraction) which improves the efficiency of identifying pulsar signals. The 
algorithm ranks the candidates based on a score function. Unlike popular machine- 
learning based algorithms, no prior training data sets are required. This algorithm has 
been applied to data from several large-scale radio pulsar surveys. Using the human- 
based ranking results generated by students in the Arecibo Remote Command Center 
programme, the statistical performance of PEACE was evaluated. It was found that 
PEACE ranked 68% of the student-identified pulsars within the top 0.17% of sorted 
candidates, 95% within the top 0.34%, and 100% within the top 3.7%. This clearly 
demonstrates that PEACE significantly increases the pulsar identification rate by a 
factor of about 50 to 1000. To date, PEACE has been directly responsible for the 
discovery of 47 new pulsars, 5 of which are millisecond pulsars that may be useful for 
© 2010 RAS pulsar timing based gravitational-wave detection projects. 
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1 INTRODUCTION 

Radio pulsars are unique celestial objects that are used as 
probes to study a wide range of physics and astrophysics 



(see, for example, [B landford ct al 



l2005l ; lLvne fc Graham-Smithll2006l ~ Studies of pulsar emis- 



19931; IL primer fc Kramer] 



sion have shed light on the properties of the interstellar 
medium and the physics of ultra-relativistic plasmas under 
high magnetic field conditions. The statistical properties of 
the pulsar population give us important information on the 
late stages of stellar evolution, the equation of state of ex- 
otic material, and the formation and evolution of binary 
and multiple-star systems. The stable rotation of radio pul- 
sars allows for unique tests of gravitation theories as well as 
the positive detection of ultra low-frequency gravitational 
waves. In all cases, the greater the number of pulsars that 
are discovered, the more physics and astrophysics we are 
able to study. 

As of 2013, more than 2000 pulsars have bee n found 
(ATNF Pulsar Catalogue, iManchester et al.l 120051 ). Since 
pulsar population models predict that the number of de- 
tectable pulsars in the Galaxy should be about 10 times 
higher than this (assuming a luminosity threshold of 
0.1 m jy kpc 2 , lFaucher-Giguere fc K aspi 20061; iLorimer et all 
2006), several major radio observatories around the world 
are conducting large-scale blind searches for more of these 
objects. Typically, pulsar surveys are performed by point- 
ing the telescope at a region of the sky for several min- 
utes to hours, then moving to another region, and repeating. 
Sophisticat ed analysis software package s, like PRESTO or 
SIGPROC (|Ransomll200ll ; lLorimerll200lf ). are applied to the 
data to search for periodic signals while taking into account 
the effects of dispersion by the interstellar medium as well as 
Doppler shifts due to the binary orbital motion. These search 
algorithms produce a possible series of "candidates" (i.e. 
files or plots containing the identified periodic signals and 
their properties). One can find detailed information about 
pulsar searching techniques from the standard refere nces 
|Lorimer fc Kramerl2005l ; lLvne fc Graham-Smithl2006t). Vi- 



sual inspection, usually aid ed by graphical tools (Keit h et al,l 
120091 ; iFaulkner et al.|[2004l ) , is still required to determine if a 
particular candidate is indeed a pulsar, as opposed to radio- 
frequency interference (RFI) . After inspection, the good can- 
didates are re-observed in order to confirm their astrophys- 
ical origin. 

Surveys produce millions of candidates. As an exam- 
ple, the North High Time Resolution Universe pulsar sur- 
vey (HTRU North), being conducted with the 100-m Ef- 
felsberg radio teles cope, is ex pected to produce 14 million 
pulsar candidates (|Barrl [2oTlh . The multibeam survey cur- 
rently ongoing at the Arecibo Radio Observatory, known 
as the Pulsar Are cibo L-band Feed Array survey (PALFA, 
ICordes et af1l2006h . generates over half a million pulsar can- 
didates per year. The Green Bank Northern Celestial Cap 
pulsar survey (GBNCC), currently being performed at the 
Robert C. Byrd Green Bank Radio telescope, produces can- 
didates at about the same rate. The Green B ank Telescope 
350MHz Drift scan puls ar survey (GBT Drift, iBovles et al.l 
120131 ; iLvnch et al.ll2013h generated over 1.2 million candi- 
dates. Assuming that it takes about one second to inspect a 
candidate by eye, one needs over 250 person-hours to eval- 
uate one million candidates. The number of candidates is 



beyond the ability of a single person. There are two natural 
ways to solve the problem: 1) apply more person power, 2) 
use computer-based methods to reduce the number of can- 
didates needing inspection. 

In an effort to gather the necessary person power and 
increase the rate at which pulsars are identified, the Arecibo 
Remote Command Center program (ARCC), developed at 
the University of Texas at Brownsville (UTB), trains stu- 
dents to search through a large number of candidates from 
these pulsar surveys. In order to increase the rate of pul- 
sar discoveries, the ARCC students have developed a pulsar 
viewing software package known as ARCC Explorer, which 
contains a set of web applications that allows multiple users 
to visually inspect and rank pulsar candidates. Further de- 
tails of the ARCC Explorer will be described elsewhere (Sto- 
vall et al. in preparation), while in this paper we focus on 
one part of the ARCC Explorer that analyzes the candi- 
dates generated by pulsar search pipelines and ranks them 
according to how "pulsar-like" they are. We call this tool 
the Pulsar Evaluation Algorithm for Candidate Extraction 
(PEACE). 

There are currently two major techniques used to re- 
duce the amount of the candidates to inspect. The first type 
is to select suitable candidates base d on several selection 
rules. For examp le, graphical tools (Faul kner et alj |2004| ; 
iKeith et all 120091 ') have been d eveloped to help t he visual 
selection. The second method ([Ke ith ct al. 2009; Ea toughl 
|2009| ; lLeell2009l ; lEatough et al.ll2010] ; lBates et al.|[2012l ) is to 
use computers to automatically select and rank the candi- 
dates. PEACE, which is a tool of the second type, calculates 
a score for each of the candidates, where the score is a mea- 
surement of the degree to which a candidate matches certain 
pulsar-like features. Based on the score from PEACE, the 
ARCC Explorer prioritizes and distributes the candidates 
to the students for evaluation. In this way, the pulsar-like 
candidates are given to the students earlier than the rest of 
the candidates. At the time of writing this paper, PEACE, 
has helped to identify a total of 47 new pulsars in PALFA, 
GBNCC, and HTRU North, five of which are millisecond 
pulsars. Students in the ARCC program at UTB and Uni- 
versity of Wisconsin-Milwaukee (UWM) identified PALFA 
and GBNCC candidates as pulsars, and they were later re- 
observed and confirmed. 

This paper describes the details of the algorithms used 
by PEACE and the techniques used to evaluate its efficiency 
in prioritizing candidates. The details of the pulsars found 
by PEACE will be discussed elsewhere. The algorithms are 
described in Section [2] In Section [3] the current implemen- 
tation is discussed in detail, together with instructions on 
how to obtain and install PEACE. In this section, we also 
discuss the efficiency of the current implementation, which 
is evaluated by comparing its ranking of GBNCC survey 
candidates against the human-based ranking generated by 
ARCC students. We discuss these results and conclusions 
are presented in Section [3] 



2 METHOD TO RANK CANDIDATES 

In this section, we explain the algorithm implemented in 
PEACE to rank candidates. PEACE has two major parts. 
First, it analyzes the candidate files from pulsar search pack- 



© 2010 RAS, MNRAS 000, [HZ] 



PEACE: Pulsar Evaluation Algorithm for Candidate Extraction 3 



ages (e.g. PRESTO and SIGPROC) and, second, calculates 
several statistics such as the signal-to-noise ratio, the pulse 
profile width, etc. We refer to these statistics as quality fac- 
tors. From these quality factors PEACE computes a score, 
which is then used to rank the candidates. We define the 
quality factors and describe how to calculate them in Sec- 
tion [271] The pulsar ranking technique is then presented in 
Section [2721 



2.1 Quality Factors 

Empirical experience has shown that one needs to inspect 
several features of a pulsar candidate in order to properly 
characterize it. In PEACE we have implemented six qual- 
ity factors, which are described below. For further details of 
the implementation, we refer readers to the documents in 
the code repository q These scores inevitably introduce se- 
lection effects in the searching process; we delay the related 
discussions to Section [74] 

(i) The signal-to-noise (S/N) ratio of the folded pulse pro- 
file. 

The S/N is a measure of the significance of the signal 
present in the data. There are various definitions of S/N. 
Here we define the S/N in relation to the pulse profile, where 
the S/N is the ratio between peak and root-mean-square 
(RMS) values. PEACE reads in the pulse profile data, de- 
termines the peak amplitude from the pulse profile. To avoid 
biasing the RMS estimation by outliers, we sort the profile 
data by intensity values and exclude the top 10%. The S/N 
is calculated as the ratio between the amplitude and the 
RMS value. Although such definition of S/N depends on 
the number of bins used to fold the profile, where fewer bins 
give higher S/N, we did not find significant correlation be- 
tween S/N and pulsar period in the GBNCC data set. There 
are other definitions for the S/N e.g. the S/N in terms of 
the mean flux, the reduced x 2 i the standard deviation of 
profile etc. Any of these quantities can be used to quantify 
the strength of pulsed signals, because they contain similar 
information. But one needs to find appropriate score func- 
tions or weights, as we will discuss below. 

(ii) The topocentric period of the source (f>to P ). 

Pulsar search codes are designed to detect a periodic sig- 
nal. As a result, each candidate has an associated signal 
period, the value of which can be indicative of the signal's 
origin. For example, RFI signals due to air traffic control 
radar typically have periods of a few seconds and RFI in- 
duced by power systems have characteristic frequencies of 
50/60 Hz depending on the geographical location of the tele- 
scope. PEACE reads the period directly from the candidate 
file. 

(iii) The width of the pulse profile (w). 

The pulse width, w is defined as the width of the pulse 
normalized by the candidate period. Therefore, w ranges 
from to 1. Typically, one measures the full-width-at-half- 
maximum (FWHM). However, the FWHM is not a robust 
measure of the pulse signal width for our application, which 
may deals with the RFI and signals with low S/N. In order 



to robustly measure the width, we fi rst fit the pulse pro - 
file to multiple Gaussian components l|Kramer et al.lll994l ). 
Overlapping components are combined, w is then calculated 
using an amplitude-weighted sum of the FWHM of each 
component. The width is thus a useful parameter to dis- 
criminate pulsar candidates from RFI, since measured pul- 
sar pulse widths are usually less than 10% llRankinl 1 19831 : 
iLvne fc Manchester! 119881 ; iMaciesiak et alj|201ll ) "and RFIs 
usually result in broad waveforms. Admittedly this breadth 
can be comparable to that seen in millisecond pulsars, how- 
ever. 

(iv) The persistence of the signal in the time domain (??x). 

The persistence of the signal in the time domain is a mea- 
sure of the fraction of the observation in which the candidate 
signal is present. The candidate file usually contains a three 
dimensional data cube, i.e. the signal intensity as the func- 
tion of the time index, the frequency index, and the pulse 
phase index. From the candidate file, PEACE reads in the 
folded pulse profile for each sub-integration (i.e. the pulse 
profile at each time index) and then calculates the on- and 
off-pulse amplitude ratio rx: 



rx 



Average of signal level in the pulse window 
Average of signal level outside the pulse window 



(1) 

where the on-pulse window is defined as that region of pulse 
phase that lies within the FWHM region of each profile com- 
ponent and the off-pulse window covers the remainder. Using 
rx calculated for each sub-integration, PEACE then com- 
putes r/T' 

Number of sub-integrations with rx > ax 
' Total number of sub-integrations 

Here Qx is a preset threshold, whose default value is 1 in 
PEACE. By definition, ryx £ [0, 1]. Since the pulsar signal 
is expected to persist for most of the observing session, true 
pulsar signals should have a high value of 7]t- 

(v) The persistence of the signal in the radio frequency 
domain (??f). 

The persistence of the signal in the radio frequency do- 
main is a measure of that fraction of the bandwidth in which 
the candidate signal is present. Similar to the calculation of 
the rx, PEACE reads in the folded pulse profile for each 
sub-band, then computes the on-and-off pulse amplitude ra- 
tio tf- Vf i s then computed as 



Vf = 



Number of sub-bands with rp > ap 
Total number of sub-bands 



(3) 



where the threshold ap is set to a default value of 1 in 
PEACE. As with r/T, Vf & [0, 1]. Since the pulsar signal is 
expected to be broadband, true pulsar signals should have 
a high value of tjf . 

(vi) The ratio between the pulse width and the DM 
smearing time (ijdm). 

PEACE reads the barycentric period (j»bar), the frequency 
channel width (A/ c ), the center frequency (/), and the dis- 
persion measure (DM) from the candidate file, then calcu- 
lates the dispersive smearing time across a single frequency 
channel: 



At = 8.3 /xs 



( A/c 



The software can be downloaded 
' http: // sourceforge. net /projects /pulsareace / ' . 



from 



VMH2 



/ 



GHz 



DM 
cm _3 pc 



(4) 



Together with the fractional pulse width w, ijdm is calcu- 
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lated as 



t?dm = 



At 



(5) 



Since the pulse width of any true astronomical signal must 
be greater than the dispersive smearing time across a single 
frequency channel, ??dm is expected to be greater than 1 for 
a real pulsar signal. Due to the small difference between the 
Pbar and the pto P , either can be used in the above calculation. 

This simple definition of DM score neglects the effects of 
sampling time. This does not have a significant impact on 
the PEACE ranking, because the sampling time effect only 
increases the measured width w of a pulse profile, which 
increases the ranking score rather than decreasing it. 



GO 



CO 




2.2 Scores 

PEACE combines the measured quality factors and calcu- 
lates a final score, S, which is stored in the database and 
later used by the ARCC Explorer to sort the candidates. 
The final score is defined as a linear combination of individ- 
ual ones, i.e. 



-10 c 



-20 6 



I3s/nS S /n(S/N) + P Ptop S ptap (ptop) 
+Pw S w (w) + Pr, T S VT (ifr 



(6) 



where PsjN, P Ptov , Pw , Pr, T , fi nF , and fi nnu are constants with 
default values of 1. The functions S$/n, 



S VF ,and 



S S/N (S/N) 



-(S/N-5) 2 




S w (w) = < 



S Vt (vt) 



for S/N^ 5, 
for S/N> 5. 

280.7k; 2 + 11.4to - 1.6, 

for w< 0.125, 
-37.9to 2 -4.1u>-4.0, 
for 0.125 ^w< 0.6, 
-20 for 0.6 <w , 

2 



~>n F 



Sr, F (Vf) 



5 ')dm( T /Dm) 




■2) 2 



??DM< 0.4, 

0.4 sj 7] DU ^2, 
2 <r/uM- 



(7) 

(8) 

(9) 
(10) 

(11) 



Each of these functions is shown in Figure [T] The form of 
these functions was inspired by the natural logarithm of the 
probability distribution of the relevant quality factor. For ex- 
ample, the function S w is an analytic approximation to the 
natural logarithm of the pulse width probability distribu- 
tion determined from al l radio pulsars in the ATNF catalog 
(Manchester ct al. 2 0051 ) . The above functions are not tuned 
to a particular survey. However, the function S ptov , which 
characterizes the closeness of the candidate period to any 
RFI signal's value, has to be determined according to the 
local RFI environment of the survey under study. At the 
beginning of a survey, one does not have any information 
about the RFI. In that case, the S pt is chosen to be 0. As 
the number of candidates becomes large, e.g. 10 4 , one can 
start to construct S pt using the following recipe: i) Calcu- 
late the histogram of the periods of all the candidates; ii) 
Use a running-median filter to determine the baseline in the 
histogram; iii) Remove the baseline from the histogram; iv) 



'DM 



2 4 
S/N 



Figure 1. The functions S s /NyS w ,Sr, T , o, ;f ,anu o^ DM 



5 Sj-fY ,3<I1Cl Sri 

PEACE to determine the overall candidate score. 



used by 



Take the negative logarithm of the histogram and re-scale it 
so that it ranges from -10 to 0; v) Interpolate the resulting 
distribution in order to create the analytic continuous func- 
tion 5 ptop . Using data to generate scores is similar to the 
ideas of machine-learning based techniques, where we use 
the data themselves as the training set to generate the RFI 
scores. In the released PEACE package, there is a dedicated 
tool, buildSP, which can generate this function from a list 
of candidate periods. As an example, S ptop for the GBNCC 
survey is shown in Figure [5] 

The use of S Pin „ was inspired by the 'bi rdie-list zap- 
ping technique' (e.g. lLorimer fc Kramerl [20051 ) . where one 
removes all of the candidates with p top in the period range 
where RFI often appearfl Instead of completely removing 
such candidates, PEACE only reduces their final score. If 
other qualities are good, PEACE can still rank such a can- 
didate highly. This allows for the discovery of pulsars with 
periods similar to the local RFI. A particular example is 
shown in Figure [3] although the period of the candidate is 
close to the RFI, PEACE still gives the candidate a high 
final score to make the candidate stand out against the RFI 
because of the other scores. 



3 IMPLEMENTATION AND APPLICATION 
3.1 Implementation 

The source code for PEACE is located at 
http://sourceforge.net/projects/pulsareace/ Currently, 
there exist three versions of PEACE where the only 
difference between these is the type of candidate file it 
analyzes. One version analyzes PRESTO candidate files 
(.pfd), one for pdmp l|Hotan et al.l |2004| ) candidate files 
(.ar), and one for image plots (.png). For .png files, PEACE 



2 There is a major difference between the birdie-list and the pe- 
riod scoring technique. The birdie-list is usually used before the 
harmonic summing, while the S ptop is applied to the final candi- 
dates. 
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Figure 2. S ptap for the GBNCC survey. The x-axis is the 
topocentric period in milliseconds, and the y-axis is the value 
of S vtn „ . 




51 52 53 



Figure 3. The PEACE score and periods for a subset of can- 
didates. The dots are all identified as RFIs, while the diamond 
symbol is the candidate later identified to be a pulsar. Although 
the periods are similar, the PEACE score of the candidate is still 
higher than the scores of RFI. 



scans user-defined regions of the image to determine the 
pulse profile, sub-integrations, and sub-band data. These 
regions can be specified in the command line with the 
pixel coordinates of the left, right, top, and bottom of 
each panel. The idea behind the .png version of PEACE 
is to extend its ability to analyze candidates generated in 
alternative pipelines, where the users measure the geometry 
parameters just once and they can process all the image 
with an identical command line. The values of the candidate 
period and DM are entered via command-line arguments 
upon execution. If these are not given, PEACE ignores 
5 ptop and S VDM , when calculating the score. 

As discussed before, PEACE uses several statisti- 
cal thresholds together with several preset constants (i.e. 
a T ,a F , /3 S /n , /3 Ptop , fiw , Pvt > Pvf > and Pvum ) to calculate the 
score. PEACE allows the user to override each of these pa- 
rameters via command-line arguments. For example, one can 
increase /3 ptop for data with comparatively worse RFI. 



3.2 Evaluation 

ARCC students have visually inspected over fO 5 candidates 
from a part of the GBNCC survey and have identified over 
70 confirmed pulsars (including previous discoveries). This 
data set is complete and un-biased, because all the can- 
didates have been visually inspected at least once. Such a 
unique data set is very valuable to evaluate the effectiveness 
of automatic pulsar ranking systems such as PEACE. 

The ideal candidate sorting algorithm would score all 
real pulsars higher than all other candidates. Therefore, the 
distribution of all the known pulsars in the list of candidates 
ranked by score is a good measure of the effectiveness of 
the ranking algorithm. Here, we used the pulsars, which are 
identified by ARCC students in GBNCC survey, as tracers 
to evaluate the PEACE performance. The measured detec- 
tion rate of PEACE is shown in Figure 01 where 68% of 
confirmed pulsars are in the top 0.17% of PEACE-ranked 
candidates, 95% are in the top 0.34%, and all are in the 
top 3.7%. These results indicate that inspecting candidates 
in order of decreasing PEACE score will significantly in- 
crease the rate of pulsar identification. For this evaluation, 
the statistical thresholds and preset constants were set to 
their default values as described above. 

For most of the time, the processing speed of PEACE 
is limited by the time of reading in files. An Intel® 2.4 GHz 
processor with 6 Mb cache usually processes a candidate file 
within 100 to 500 ms, which corresponds to 0.2 to 1 million 
candidates per processor per day. 



4 DISCUSSIONS AND CONCLUSIONS 

In this paper, we have described PEACE, a software pack- 
age for post-analysis processing of pulsar survey candidates. 
PEACE uses a set of algorithms to analyze a pulsar candi- 
date and calculate a score, which is a measure of how likely 
a candidate is to be a real pulsar. These algorithms are de- 
scribed and the effectiveness of PEACE has been evaluated. 
Using candidates generated by the PEACE survey and in- 
spected by students in the ARCC program, it was shown 
that PEACE significantly increases the rate of identifying 
pulsars. For example, four million candidates requires ap- 
proximately 10 3 person hours in order to visually inspect 
each one. If one pre-sorts these candidates according to their 
PEACE score, 100% of the pulsars are expected to be in the 
top 150 thousand candidates. Inspecting these candidates 
only requires 40 person hours. Such efficiency will, hopefully, 
help the pulsar surveys using future large telescopes, such as 
the Five-hundred-meter Aperture S pherical Radio Telescope 
jNan et alJl2006t ISmits et al.ll200^) and the Square Kil ome- 
tre Array ^Kramer fc Stapper j|20icl ; ISmits et al.1l2009l ). 

PEACE uses six quality factors to determine a candi- 
date's score. These are the signal-to-noise ratio (S/N), the 
candidate period, the pulse width, the signal's persistence 
in the time and the frequency domains, and the pulse width 
to DM smearing time ratio. These quality factors are cho- 
sen because they are readily available from standard pulsar 
searching pipelines (e.g. PRESTO and SIGPROC). Also, 
human experience has shown that these particular quality 
scores are helpful in differentiating between pulsar candi- 
dates and RFI. As shown in Figure [4] there is no single 
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Fraction of candidates 



Figure 4. In order to determine the efficiency of PEACE, candidates are sorted in descending order according to the score generated by 
PEACE. In the above figure, the fraction of top ranked candidates is plotted versus the fraction of confirmed pulsars in those candidates. 
The thick solid, dashed, dot-dashed, dotted, solid, circled, and cross-marked curves are for the ranking using the score S, S§/Ni S w , Sdm> 
S ptop , S VT , and S VF , respectively. We can see that using the final score (S) is better than using the individual ones. The ranking using 
the final score S puts 68% of confirmed pulsars in the top 0.17% of ranked candidates, 95% are in the top 0.34%, and all are in the top 
3.7%. 



quality factor dominating the final score. For example, typi- 
cal survey analyses will remove candidates whose periods lie 
within RFI-dominated regions (e.g. the 'birdie-list zapping 
technique'). PEACE simply reduces the score of such can- 
didates. Thus, it is still possible to find pulsars within the 
RFI-dominated regions. 

Beside the six quality factors used in this paper , 
there are other possi bilities. For example. lEatou gh (2009); 
lEatough et~ai1 (|2010l ') find that the \ 2 of fit to the theo- 
retical DM-S/N curve and other factors can be useful for 
neural network algorithms in identifying pulsars. From our 
experience with PALFA and HTRU North surveys, the score 
function for the DM-S/JV curve fitting \ 2 wm n °t be a sim- 
ple shape. In this way, a more complex scoring scheme with 
more quality factors may further improve the current per- 
formance of PEACE. 

PEACE is fully pre-determined and does not re- 
quire any training data sets. This is diffe rent from other 
approaches that use neural netwo rks l|Eatoughl 120091 ; 
lEatough et alJlioiol ; iBates et eJ.Jl2012h , which determine the 
strategy for ranking candidates by 'learning' knowledge from 
training data sets. Although PEACE does not require such 
initial training data sets, it can be further fine-tuned when 
such data become available. 



As a caveat, using PEACE introduces selection effects 
in the searching. For machine-learning algorithms, it is hard 
to quantify the selection effects, since they are inherited from 
the training data sets. For PEACE we know exactly what 
the selection effects are and the users can adjust the score 
weights to adapt to particular purposes. As indicated in Fig- 
ure [T] PEACE prefers candidates with small pulse widths, 
high S/N, wide-band signals, and persistent pulses. It also 
down-weights the candidates with pulse profiles narrower 
than the channel DM smearing widths. Using multiple scores 
reduces the chance of missing good candidates to a certain 
degree, although it may still give low scores for the can- 
didates with wide pulse profiles and low S/N. For certain 
pulsars, the pulse energy can decrease by a factor of ten 
or more over a sh ort timescale and then increase just as 
sharply afterwards (|Backeilll97(ih . Such 'nulling' pulsar may 
get a lower score due to the persistence score t/t- Similarly, 
in the radio frequency do main, signals can scintillate due to 
the interstellar medium (Rickctt 1990), which reduces the 
score 7/f. 

PEACE can be a starting point for machine-learning 
algorithms. Since we have demonstrated that the quality 
factors calculated by PEACE effectively quantify how likely 
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a candidate is of being a pulsar, more advanced algorithms 
may further improve PEACE'S scoring performance. 
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